Efficient Bounds for the Softmax Function and Applications to Approximate Inference in Hybrid models

نویسنده

  • Guillaume Bouchard
چکیده

The softmax link is used in many probabilistic model dealing with both discrete and continuous data. However, efficient Bayesian inference for this type of model is still an open problem due to the lack of efficient upper bound for the sum of exponentials. We propose three different bounds for this function and study their approximation properties. We give a direct application to the Bayesian treatment of multiclass logistic regression and discuss its generalization to deterministic approximate inference in hybrid probabilistic graphical models. The softmax function is the extension of the sigmoid function for more than two values. Its role is of central importance in many non-linear probabilistic models. In particular, many well-known models deal with discrete and continuous data. Variational approximations based on the minimization of the Kullback-Leibler divergence are one of the most popular tools in large-scale Bayesian inference. In recent years, generic tools such as VIBES [1] have been proposed for inference and learning of graphical models using mean field approximations. For graphs having discrete nodes with continuous parents, the direct mean field cannot be applied, since there is no conjugate family for the multinomial logistic model. Local variational approximations have been proposed in the case of binary variables [2]. They are based on a quadratic lower bound for the log of the sigmoid function. This is used in directed probabilistic graphical models having binary variables with continuous parents [3, 1]. However, for more than 2 categories, the inference problem remains unsolved. In section 2, give an detailed overview of the techniques used by different authors to deal with Bayesian inference for models involving softmax links. In section 3, we derive three simple lower bounds and discuss their approximation properties. In Section 4 we perform numerical experiments on a typical application which is the variational approximation of the multiclass logistic regression. Finally, we discuss its application to more general deterministic approximate inference algorithms in graphical models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The use of wavelet - artificial neural network and adaptive neuro fuzzy inference system models to predict monthly precipitation

Precipitation forecasting due to its random nature in space and time always faced with many problems and this uncertainty reduces the validity of the forecasting model. Nowadays nonlinear networks as intelligent systems to predict such complex phenomena are widely used. One of the methods that have been considered in recent years in the fields of hydrology is use of wavelet transform as a moder...

متن کامل

Determining the Optimal Value Bounds of the Objective Function in Interval Quadratic Programming Problem with Unrestricted Variables in Sign

In the most real-world applications, the parameters of the problem are not well understood. This is caused the problem data to be uncertain and indicated with intervals. Interval mathematical models include interval linear programming and interval nonlinear programming problems.A model of interval nonlinear programming problems for decision making based on uncertainty is interval quadratic prog...

متن کامل

ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM OPTIMIZATION USING PSO FOR PREDICTING SEDIMENT TRANSPORT IN SEWERS

The flow in sewers is a complete three phase flow (air, water and sediment). The mechanism of sediment transport in sewers is very important. In other words, the passing flow must able to wash deposited sediments and the design should be done in an economic and optimized way. In this study, the sediment transport process in sewers is simulated using a hybrid model. In other words, using the Ada...

متن کامل

Accurate Inference for the Mean of the Poisson-Exponential Distribution

Although the random sum distribution has been well-studied in probability theory, inference for the mean of such distribution is very limited in the literature. In this paper, two approaches are proposed to obtain inference for the mean of the Poisson-Exponential distribution. Both proposed approaches require the log-likelihood function of the Poisson-Exponential distribution, but the exact for...

متن کامل

Inference for the Type-II Generalized Logistic Distribution with Progressive Hybrid Censoring

This article presents the analysis of the Type-II hybrid progressively censored data when the lifetime distributions of the items follow Type-II generalized logistic distribution. Maximum likelihood estimators (MLEs) are investigated for estimating the location and scale parameters. It is observed that the MLEs can not be obtained in explicit forms. We provide the approximate maximum likelihood...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007